Universal Access To All Knowledge
Home Donate | Store | Blog | FAQ | Jobs | Volunteer Positions | Contact | Bios | Forums | Projects | Terms, Privacy, & Copyright
Search: Advanced Search
Anonymous User (login or join us)
Upload

Reply to this post | Go Back
View Post [edit]

Poster: Administrator, Curator, or StaffTomBombadil Date: Feb 21, 2014 9:05am
Forum: texts Subject: OCR Help

By what means would it be possible to extract/download a text file of an existing OCR'd newspaper I have uploaded, to improve (as in completely retype important articles) and upload to improve the overall quality of certain texts?

If this is not possible, is there a certain step that I'm somehow skipping before I upload the newspaper to the internet archives, that I can go back to in order to rewrite the newspaper before the OCR takes place on the internet archives.

I realize this second option would require me to run the Acrobat OCR, but I'm worried even if I rewrite the article, the Internet Archive OCR will revert back to the original crapped up version.

Reply to this post
Reply [edit]

Poster: Administrator, Curator, or StaffTomBombadil Date: Feb 21, 2014 9:44am
Forum: texts Subject: Re: OCR Help

Okay so I've searched past topics and read that this is possible, however I'm unsure of where I upload the corrected OCR, via edit item>Edit Files>Upload new file, or as a completely new item entirely--and link the new file via item review.

The latter way troubles me, as many people I feel overlook the reviews on the items, however, I'm unsure if I follow the first option, what I have to delete, in order so the corrected .txt file is used.